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Abstract. Evolutionary algorithms are popular heuristics for solving 
various combinatorial problems as they are easy to apply and often pro- 
duce good results. Island models parallelize evolution by using different 
populations, called islands, which are connected by a graph structure as 
communication topology. Each island periodically communicates copies 
of good solutions to neighboring islands in a process called migration. 
We consider the speedup gained by island models in terms of the parallel 
running time for problems from combinatorial optimization: sorting (as 
maximization of sortedness), shortest paths, and Eulerian cycles. Differ- 
ent search operators are considered. The results show in which settings 
and up to what degree evolutionary algorithms can be parallelized effi- 
ciently. Along the way, we also investigate how island models deal with 
plateaus. In particular, we show that natural settings lead to exponential 
vs. logarithmic speedups, depending on the frequency of migration. 



1 Introduction 

Evolutionary algorithms (EAs) are popular heuristics for various combinatorial 
problems as they often perform better than problem-specific algorithms with 
proven performance guarantees. They are easy to apply, even in cases where the 
problem is not well understood or when there is not enough time or expertise 
to design a problem-specific algorithm. Another advantage is that EAs can be 
parallelized easily |17j . This is becoming more and more important, given the 
development in computer architecture and the rising number of CPU cores. 
Developing efficient parallel mctahcuristics is a very active research area |H16j . 

A simple way of using parallelization is to use so-called offspring populations: 
new solutions (offspring) are created and evaluated simultaneously on different 
processors. Island models use parallelization on a higher level. Subpopulations, 
called islands, which are connected by a graph structure, evolve independently 
for some time and periodically communicate copies of good solutions to neigh- 
boring islands in a process called migration. Migration is typically performed 
every r iterations, the parameter r being known as migration interval. A slow 
spread of information typically yields a larger diversity in the system, which can 
help for optimizing multimodal problems. For other problems a rapid spread of 
information (like setting r = 1 and migrating in every iteration) is beneficial, 
assuming low communication costs [S]. 
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Despite wide-spread applications and a long history of parallel EAs, the the- 
ory of these algorithms is lagging far behind. Present theoretical work only con- 
cerns the study of the spread of information, or takeover time, in isolated and 
strongly simplified models (see, e.g., [13]) as well as investigations of island 
models on constructed test functions |7I8I10] . It is agreed that more fundamental 
research is needed to understand when and how parallel EAs are effective [IS] , 

In this work we consider the speedup gained by parallelization in island 
models on illustrative problems from combinatorial optimization. The question 
is in how far using /i islands (each running an EA synchronously and in parallel) 
can decrease the number of iterations until a global optimum is found, compared 
to a single island. The number of iterations for such a parallel process is called 
parallel optimization time If the expected parallel optimization time is by a 
factor of 0(n) smaller than the expected time for a single island, we speak of 
an (asymptotic) linear speedup. A linear speedup implies that a parallel and a 
sequential algorithm have the same total computational effort, but the parallel 
time for the former is smaller. We are particularly interested in the range of \i 
for which a linear speedup can be guaranteed. This degree of parallclizability 
depends on the problem, the EA running on the islands, and the parameters 
of the island model. Our investigation gives answers to the question how many 
islands should be used in order to achieve a reasonable speedup. Furthermore, it 
sheds light on the impact of design choices such as the communication topology 
and the migration interval t as the speedup may depend heavily on these aspects. 

Following previous research on non-parallel EAs |12j . we consider various 
well-understood problems from combinatorial optimization: sorting as an opti- 
mization problem ffj] (Section 2|) , the single-source shortest path problem |2l5j 
(Section [SJ, and the Eulerian cycle problem |3I4I11] (Section [5]). As in previous 
studies, the purpose is not to design more efficient algorithms for well-known 
problems. Instead, the goal is to understand how general-purpose heuristics per- 
form when being applied to a broad range of problems. The chosen problems 
contain problem features that are also present in more difficult, NP-hard prob- 
lems. In particular, the Eulerian cycle problem contains so-called plateaus, that 
is, regions of the search space with equal objective function values. Our investi- 
gations pave the way for further studies that may include NP-hard problems. 

For the sake of readability, some proofs are put in an appendix. 

2 Preliminaries 

Island models evolve separate subpopulations — islands — independently for some 
time. Every r generations at the end of an iteration, or generation using com- 
mon language of EAs, copies of selected search points or individuals are sent as 
migrants to neighbored islands. Depending on their objective value /, or fitness, 
migrants are in the target island's population after selection. The neighborhood 
of the islands is defined by a topology, a directed graph with the islands as nodes. 

Algorithm [TJ presents a general island model, formulated for maximization. 
Like in many previous studies for combinatorial optimization [12] . we consider 
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Algorithm 1 Island model 

Let t :— 0. For all 1 < i < n initialize population Pq uniformly at random, 
repeat 

For all 1 < i < [i do in parallel 

Choose x l G PI uniformly at random. 

Create y l by mutation of x 1 . 

Choose z % G PI with minimum fitness in P t ! . 

if f(vl > f(zl then P* +1 = PI \ {**} U {f} else P* +1 = P*. 

if t mod t — and t > then 

Migrate copies of an individual with maximum fitness in P/ +1 

to all neighbored islands. 
Let y l be of maximum fitness among immigrants. 
Choose z l G Pt+i with minimum fitness in P t l + i. 
if /(y*) > f(z>) then P/ +1 = P* +1 \ {z*} U 
Let t = t + 1. 

islands of population size only 1, running variants of the (1+1) EA or randomized 
local search (RLS). Both maintain a single search point and create a new search 
point in each generation by applying a mutation operator. This offspring replaces 
the current solution if its fitness is not worse. RLS uses local operators for 
mutation, while the (1+1) EA uses a stochastic neighborhood [T^]. For the /Lt- 
vertex complete topology K^, the island model then basically equals what is 
known as (l+/i) EA or (l+/x) RLS, respectively, if we migrate in every generation 
(r = 1): the best of yU offspring competes with the parent as in the (1+1) EA. 

We consider different topologies to account for different physical architectures 
and assume that the communication costs on the physical topology are so low 
that it allows us to focus on the parallel optimization time only. Unless specified 
otherwise, we assume r = 1. 

3 Previous Work 

The authors [9110] presented general bounds for parallel EAs by generalizing the 
fitness-level method or method of f -based partitions (see Wegener [18j). The idea 
of the method is to divide the search space into sets A\ , . . . , A m strictly ordered 
w. r. t. fitness: Ai </ A 2 </ ■ ■ • </ A m where A <f B iff f(a) < f(b) for every 
a G A, b G B. In addition, A m contains only global optima. 

We say that a population-based algorithm (including populations of size 1) is 
in Ai or on level i if the current best individual in the population is in Aj. Elitist 
algorithms (defined as algorithms where the best solution in the population never 
worsens) can only increase the current level. The goal is to reach A m . If Sj is a 
lower bound on the probability of leaving Ai towards any higher fitness level in 
one generation, the expected waiting time is at most 1/sj. As every level has to 
be left at most once, the expected optimization time is at most 2i=i l/ s i- 

The authors [3] generalized this method for island models that run elitist 
islands, for commonly used topologies. If migration is used in every generation, 
information about the current best fitness level is propagated to neighbored 
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islands. This increases the number of islands searching for better fitness levels 
in parallel. The following theorem summarizes (a refinement of) our results. 

Theorem 1. Consider an island model with fi islands where each island runs 
an elitist EA. In every iteration each island sends copies of its best individual to 
all neighbored islands (i. e. r = 1). Each island incorporates the best out of its 
own individuals and its immigrants. 

For every partition A\ <f ■ ■ ■ <f A m if Si is a lower bound for the probability 
that in one generation an island in Ai finds a search point in Ai + i U • • • U A m 
then the expected parallel optimization time is bounded by 

1. 2 X^i 1 ~T/2 + j; X^™]. 1 7" f or every unidirectional ring ( a ring with edges in 
one direction) or any other strongly connected topology, 

2. 3 Y^iLi ~T/3 + 71 Eti 7" f or ever U undirected grid or torus graph with side 
lengths at least ^fji x ^fji, 

3. m + i StLi J" f or th> e complete topology K^. 

Assuming the fitness-level bound for the time Yh^i ~ °f a single island is 
asymptotically tight, all three bounds yield an asymptotic linear speedup in case 
the first summands are each of at most the same order as the second summand. 

Apart from the different constants 2 and 3, denser topologies yield better 
upper bounds than sparse ones. This makes sense as with the fitness level ar- 
gumentation a rapid spread of information gives the best estimates for the time 
an improvement is found. The motivation for studying sparse topologies is that 
they have lower communication cost and they yield a larger diversity. An exam- 
ple where this diversity is beneficial will be given in Section [fU 

4 Sorting 

Wc start our investigations with the first combinatorial problem for which EAs 
have been analyzed. Scharnow, Tinnefeld, and Wegener fTJ] considered the clas- 
sical sorting problem as an optimization problem: given a sequence of n distinct 
elements from a totally ordered set, sorting is the problem of maximizing sorted- 
ness. W. 1. o. g. the elements are 1, . . . , n, then the aim is to find the permutation 
7r op t such that (7T op t(l), . . . ,7r op t(n)) is the sorted sequence. 

The search space is the set of all permutations 7r on 1, . . . , n. Two different 
operators are used for mutation. An exchange chooses two indices i =/= j uniformly 
at random from {1, . . . , n} and exchanges the entries at positions i and j. A jump 
chooses two indices in the same fashion. The entry at i is put at position j and 
all entries in between are shifted accordingly. For instance, a jump with i = 2 
and j = 5 would turn (1, 2, 3, 4, 5, 6) into (1, 3, 4, 5, 2, 6). 

The (1+1) EA draws S according to a Poisson distribution with parameter 
A = 1 and then performs 5* + 1 elementary operations. Each operation is either 
an exchange or a jump, where the decision is made independently and uniformly 
for each elementary operation. The resulting offspring replaces its parent if its 
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Algorithm INV HAM, LAS, EXC 

(1+1) EA 0(n 2 log n) pj] 0(n 2 log n) [14] 

island model on ring O (n 2 + " '° gn 

island model on torus O (n 2 + - log " 

island model on EA O (n 2 + 
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Table 1. Upper bounds for expected parallel optimization times for the (1+1) EA and 
the corresponding island model with fi islands for sorting n objects. 



fitness is not worse. The fitness function fn t (Tt) describes the sortedness of 
(7r(l), . . . , 7r(n)). As in |14j . we consider the following measures of sortedness: 

— INV(7r) measures the number of pairs 1 < i < j < n, such that ir(i) < 
7r(j) (pairs in correct order) 

— HAM(7r) measures the number of indices i such that jr(i) = i (elements at 
the correct position), 

— LAS(-7r) equals the largest k such that 7r(ii) < • • • < 7r(ifc) for some ii < 
■ ■ ■ < ik (length of the longest ascending subsequence), 

— EXC(7r) equals the minimal number of exchanges (of pairs n(i) and 7r(j)) to 
sort the sequence, leading to a minimization problem. 

The expected optimization time of the (1+1) EA is fl(n 2 ) and 0(n 2 log n) for 
all fitness functions. The upper bound is tight for LAS, and it is believed to be 
tight for INV, HAM, and EXC as well [14]. Theorem Q] yields the following. 

Theorem 2. The expected parallel optimization times of the (1+1) EA and the 
corresponding island model with fi islands are as in Table [7] 

For INV, all topologies guarantee a linear speedup only in case fi = 0(\og n) and 
the bound 0{n 2 logn) for the (1+1) EA is tight. The other functions allow for 
linear speedups up to ^ = 0{n x / 2 \ogn) (ring), [i = 0(n 2 ^ 3 \ogn) (torus), and 
li = O(nlogn) (K^), respectively (again assuming tightness, otherwise up to a 
factor of logn). Note how the results improve with the density of the topology. 
HAM, LAS, and EXC yield much better guarantees for the island model than 
INV, though there is no visible performance difference for a single (1+1) EA. 



5 Shortest Paths 

We now consider parallel variants of the (1+1) EA for the single source shortest 
path problem (SSSP). Its complexity for the (1+1) EA has been first considered 
in |14j . An SSSP instance is given by an undirected connected graph with vertices 
{1, . . . , n} and a distance matrix D = (<iy)i<i,j< ra where dij G Rq U {oo} defines 
the length value for given edges from node i to node j. We are searching for 
shortest paths from a node s (w. 1. o. g. s = n) to each other node 1 < i < n — 1. 

A candidate solution is represented as a shortest paths tree, a tree rooted at s 
with directed shortest paths to all other vertices. We define a search point x as 
vector of length n— 1, where position i describes the predecessor node Xi of node i 
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Algorithm vertex-based mutation [14] edge-based mutation [5] 

(1+1) EA 0(n 2 t) 0{mt) [5] 

island model on ring O (n 3 ? 2 ^ 2 + » 3 'WO ) O (m^ 2 n^ 2 £^ 2 + Z2«MpZ£i) 

^M = 0(M) 1/2 ) — > M = 0((m/n-^) 1/2 ) 

island model on torus O (n 4 / 3 ^ 3 + ''^"ffl ] O (m^n 2 / 3 ^ 3 + ""WO ) 

^M = 0(M) 2/3 ) — !> = 0((m/n ■ £) 2 ^ 3 ) 
i. m. on A' m /(1+m) EA O (n + " 2 ""<W<) j O (n + m " n ^ en/£) ^) 

— >■ /j = O(rtl) — >■ /x = Q(m/n-£) 

Table 2. Worst-case expected parallel optimization times for the (1+1) EA and the 
corresponding island model with /i islands for the SSSP on graphs with n vertices and 
m edges. The value £ is the maximum number of edges on any shortest path from the 
source to any vertex and £* := max{^,lnn}. The second lines show a range of fi- values 
yielding a linear speedup, apart from a factor ln(en/^). 



in the shortest path tree. Note that infeasible solutions are possible in case the 
predecessors do not encode a tree. An elementary mutation chooses a vertex i 
uniformly at random and replaces its predecessor Xi by a vertex chosen uniformly 
at random from {1, . . . ,n}\ {i,Xi}. We call this a vertex-based mutation. The 
(1+1) EA creates an offspring using S elementary mutations, where S is chosen 
according to a Poisson distribution with A = 1 . 

The fitness function is defined as follows: Let f{x) — (fi(x), . . . , f n -i{x)) 
and fi(x) code the length of the path from s to i if it is described by x or 
fiix) — oo otherwise. The function f(x) defines a partial order on the search 
points: f(x) < f{x') fi(x) < fi(x') for aU % £ {1, 2, . . . , n - 1}. That defines 
a multi-objective minimization problem but there is exactly one Pareto optimal 
fitness vector. The multi-objective (1+1) EA chooses an initial search point x 
uniformly at random and performs in each iteration a mutation step as described 
above. The new search point x' is accepted if f(x') < f(x). 

The expected parallel optimization time can be bounded as follows. Partition 
the vertices into layers 1,...,£ such that the j-th layer contains all vertices 
having shortest paths of at most j edges. When shortest paths have been found 
for all layers 1, . . . ,j, shortest paths for vertices in layer j + 1 can be found by 
assigning the correct predecessor in a lucky mutation. The probability for making 
an improvement is at least z/(en 2 ), in case i vertices on layer j still need to find 
the right predecessors [14] . Applying Theorem [T] to all layers and considering a 
worst-case for the arrangement of layers yields the following upper bounds. 

Theorem 3. The expected parallel optimization times of the multi-objective 
(1+1) EA and the corresponding island model with fi islands are bounded ac- 
cording to the first column of Table [J] 

The upper bounds for the island models with constant /i match the expected 
time of the (1+1) EA in case £ = O(l) or £ = Q{n) as then £ln(en/£) = Q(£*). In 
other cases the upper bounds are off by a factor of ln(en/£). Table [5] also shows 
a range of ^-value for which the speedup is linear (if I = 0(1) or £ = J7(n)) 
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or almost linear, that is, when disregarding the ln(en/£) term. Note how the 
possible speedups significantly increase with the density of the topology. 

Doerr and Johannsen [5] presented the following novel mutation operator. 
Imagine predecessors to be represented by a set of edges such that for each 
vertex v there is exactly one edge with end point v in the set. Each elementary 
mutation consists of choosing an edge (u, v) of the graph uniformly at random, 
adding it to the set, and removing the edge with end point v from the set. This 
saves the (1+1) EA from assigning predecessors that are not connected to the 
vertex and it decreases the expected running time of the (1+1) EA by a factor 
of 0{m/n 2 ). By Lemma 3 in [5] the (lower bound for the) probability of making 
an improvement is increased to i/(em). The resulting bounds for the (1+1) EA 
using this mutation operator are shown in the second column of Table [5J 

Note that the ranges for possible speedups arc never greater than for vertex- 
based mutations. This is because edge-based mutations are sometimes more ef- 
ficient and never worse than vertex-based mutations in the (1+1) EA. 

6 Eulerian Cycles 

Given an undirected, looplcss Eulerian graph, the task is to find an Eulerian 
cycle, that is, a graph traversal where each edge is traversed exactly once. A 
straightforward representation leads to plateaus, i. e., regions of equal fitness that 
have to be overcome by an EA. The performance of EAs on Eulerian cycles has 
been investigated in [31416111] where it has been shown that more sophisticated 
operators and representations lead to increasingly better performance. 

Neumann [TT] suggested a representation motivated by Hicrholzer's algo- 
rithm. The idea of this algorithm is to subsequently concatenate cycles. This 
gives a walk, that is, a sequence of edges. When the walk includes all edges of 
the graph, an Eulerian cycle is created. Walks are represented by a permutation 
of the edges of the graph. The length of a walk (ei, &2, ■ ■ ■ , e m ) is the largest 
integer £ such that for all 1 < i < I — 1 the edges a and a+i share a vertex. So, 
it is the length of a partial Euler walk. The first and last vertices of e\ and et are 
called start and end of the walk, resp. Neumann [TT] as well as Doerr, Hebbing- 
haus, and Neumann [3] consider the length of the current walk as fitness and use 
jumps as mutation operators for RLS and the (1+1) EA. RLS always performs 
one jump, while the (1+1) EA chooses the number of jumps as in Scctionf?] 

With the edge walk representation, fitness can be increased by appending a 
proper edge to the current walk. However, this operation is not always possible 
in case the current walk has closed a cycle. To see this, Neumann [TT] defined the 
instance G' as the concatenation of two cycles C and C , each consisting of m/2 
edges, that share one common vertex v* . This instance represents an asymptotic 
worst case for the time until an improvement is found. 

If the current walk coincides with C, say, the current walk can only be ex- 
tended by a single jump if it starts and ends with the vertex v* . If it docs not, 
the walk needs to be rotated until v* becomes start and end of the current walk. 
Rotations can be done by a jump with parameters (1, m/2) or (m/2, 1). As the 
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Mutation operator RLS par. RLS, frequent migr. par. RLS, rare migr. 

Unrestricted Q(m*) [1113] fMm* / (loe u)) 0(m 3 + ■ m 4 ) 

Restricted, symmetric 6>(m 3 ) J?(m 3 /(log u)) 0(m 2 + 3~ M • m 3 ) 

Restricted, asymmetric 0(m 2 ) 0{m 2 ) 0(m 2 ) 

Table 3. Expected parallel optimization times for RLS and the island model running 
RLS on fi — poly(?n) islands with topology T for computing an Eulerian cycle on 
G' . "Frequent migrations" is r ■ diam(T) • /j, — 0(m 2 ) for unrestricted jumps and 
r-diam(T)-/i = 0(m) for symmetrically restricted ones, respectively. "Rare migrations" 
is t > m? and r > m 2 , respectively. 

fitness of all possible rotations of C is equal, the algorithm has to search on 
a plateau. Since the two above jumps are equally likely, rotating C with RLS 
corresponds to a fair random walk. With constant probability, the cycle needs 
to be rotated by a distance of (to). This takes an expected number of (to 2 ) 
steps of the random walk. As only two out of to(to— 1) possible jump operations 
are accepted, waiting for accepted jumps yields an additional factor of 0(m 2 ). 
The expected optimization time of both RLS and (1+1) EA on G is 0(m ) [TTj . 

G is a simple and natural instance as it represents the key features of the 
problem in a very clear way. It represents a worst-case for a single fitness level. 
It is not necessarily a global worst case as there is only one difficult fitness level, 
leaving a gap of to to a general upper bound of 0(to 5 ) for all Eulerian graphs [llj . 
For simplicity, we focus on RLS instead of the (1+1) EA — here, both have equal 
asymptotic performance anyway [31 1 1J . Results are summarized in Table [3] 

We give an example where parallelization does not reduce the parallel opti- 
mization time in any meaningful way. It can be shown that on G a single island 
with constant probability arrives at a solution where the current walk equals 
one of the two cycles and the cycle has to be rotated by a distance of 0{m). If 
the migration interval is small enough (depending on the number of islands and 
the diameter of the topology), there is further a constant probability that this 
solution was spread throughout all islands. As only strictly better immigrants 
are considered for inclusion, all islands perform independent random walks. As 
the time for completing the random walk is highly concentrated, the expected 
time until the first island finds an improvement is still /?(m 4 /(log/i)). 

Theorem 4. Consider the island model with an arbitrary strongly connected 
topology T running RLS with jumps on each island. If r ■ diam(T) • \i = 0(m 2 ) 
then the expected number of generations on G is at least J7(m 4 /(log/^)). 

Using any polynomial number of islands only reduces the expected optimization 
time by at most a log-factor. However, in other settings parallelization can help 
dramatically. One positive effect of an island model is that islands can make 
different decisions on how to extend the current walk. On G this can make a 
difference between reaching the plateau and avoiding it completely. 

In the beginning RLS typically evolves a walk on one of the two cycles C and 
G . If v*, the vertex connecting C and G, is included in the current walk, the 
walk can cither be extended towards the "opposite" cycle or it can move past 
v* and close the current cycle. In the former case a Eulerian cycle can be found 
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easily by adding edges one-by-one. But in the latter case RLS has closed a cycle 
prematurely and it now has to rotate the walk to be able to include edges from 
the opposite cycle. This rotation dominates the expected running time. 

Parallclization can help to make the right decision through independent evo- 
lution. If islands are run in parallel and if they evolve independently for at least 
r > m 3 generations, they tend to make independent decisions. This includes the 
case where no migration happens at all. The islands that have made the good 
decisions finish first, in expected time <9(m 3 ). The remaining islands need (9(m 4 ) 
steps in expectation. The probability of making a good decision is at least 2/3 
as a walk ending at v* can be extended by either of 3 edges, two of which lead to 
the opposite cycle; all 3 edges have the same probability for being added. Hence, 
the probability that a rotation — and time (9(m 4 ) — is needed is 3 _M . 

Theorem 5. The island model running RLS on ^ < poly(m) islands, r > m 3 ; 
and an arbitrary topology optimizes G in expected 0(m 3 + 3 _A1 -m 4 ) generations. 

The choice [i = log 3 m leads to an expected parallel time of 0(m 3 ). This is 
a superlinear and, technically, even an exponential speedup. This is the first 
proof that island models can lead to a superlinear speedup on problems from 
combinatorial optimization. 

The above result generalizes to instances where at v* more than two cycles 
come together. On other graphs the probability of not closing a cycle prematurely 
is exponentially small |3j and no speedups are possible. Details are omitted. 

The results seen so far can be improved by restricting the mutation oper- 
ator. The length of the current walk can only be increased in RLS if an edge 
jumps to either position 1 or I + 1. Choosing the second parameter uniformly 
from {1,£ + 1} (called a symmetric restriction) decreases all time bounds by a 
factor of 0(m) (see Table The authors of [3] introduced an asymmetric jump 
operator where the second parameter is fixed to 1, i.e., all edges are prepended 
to the current walk. This innocent-looking modification makes rotating cycles 
much easier as rotations are only possible in one direction. This removes the 
random-walk behavior, implying that the performance difference between fre- 
quent and rare migrations breaks down. It follows from Theorem 2 in [5] that 
then the island model running RLS with this operator finds an optimum on G' 
in expected 0(m 2 ) generations, for any topology. 

7 Conclusions 

Considering speedups of island models has led to a surprising richness of results. 
For sorting linear speedups are possible, but the guarantees for parallelizability 
significantly depend on the measure of sortedness and the topology. The single- 
source shortest paths problem also allows for linear speedups, the maximum 
number of islands depending on the topology and the mutation operator. For 
Eulcrian cycles results are inconclusive. Parallclization does not always help to 
speed up search on plateaus. However, it can help in some cases by avoiding 
plateaus if decisions where to extend the current edge walk are made correctly. 
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Also the parameters of the island model play a key role. On the same natural 
instance G' spccdups vary grossly from exponential up to [i = O(logm) for 
rare or no migrations to at most logarithmic spccdups, if migration is used too 
frequently and diversity is lost. Spccdups also vary with the mutation operator. 

Acknowledgments: The second author was supported by EPSRC grant 
EP/D052785/1. 
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A Appendix 

The appendix contains all proofs that were omitted in the main part of the 
paper. In the following T par denotes the parallel optimization time and H(n) 
denotes the n-th harmonic sum. 



A.l Proofs from Section 3 



Proof of Theorem [7J For the third claim, observe that with all islands will be 
on the current best fitness level after migration. Then the probability of reaching 
a higher fitness level is at least 1 — (1 — si)^ and the expected time is bounded 
by 

1 11 

<1 + , (1) 



l-(l- Sl ) M " fi 

where the inequality was proposed by Jon Rowe (personal communication, 2011); 
it can be proven by a simple induction. 

For the first bound, we claim that for every integer k < fi the expected 
time until fitness level i is left is bounded by k + r • jr- The reason is that 
after k — 1 generations at least k islands will be on the current best fitness 
level. This holds for the unidirectional ring and, in fact, for arbitrary strongly 
connected topologies. Along with ((T|), this proves the claim. Now, if fi > k := 
s i (ignoring rounding issues), the expected number of generations on fitness 
level i is bounded by k + 1/k ■ s" 1 = s -1 / 2 + s -1 / 2 . If fi < s i 1 , we get for 
k := fi an upper bound of fi + 1/fi ■ s" 1 > s i 1//2 + 1/ fx ■ sj 1 . Together, this proves 
the claimed bound. 

Likewise, for the second bound after 2(Vk — 1) iterations at least k islands 
will be on the current fitness level as this time is sufficient to cover a rectangular 
area of yfk x \fk vertices in the topology. The expected time for leaving level i 
thus at most 2Vk + r • ^ for all k < fi, again using (fl]). If /i > k := s i 2//3 , this 



is 



k 

gives 2s~ 1 ^ 3 + s~ 1 ^ 3 = 3s~^ 3 . Otherwise, k := fi yields a bound of 2^/Jl+ i • j- < 

2s - 1 /3 + l.i. "□ 

2 /i Si 



A. 2 Proofs from Section 4 (Sorting) 

Proof of Theorem [H In the proof of Theorem 2 in [TJ] lower bounds for prob- 
abilities of improving the current fitness have been established. For the func- 
tion INV there are m := ( J 2 l ) fitness levels. Using the straightforward partition 
Ai := {x | f(x) = i}, the probability of an improvement on fitness level m — i 
is at least 3i/(2en(n— 1)). Applying the first result from Theorem[T]we get the 
following upper bound for the parallel expected time of an island model with an 
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arbitrary topology. Using Y^Li 1/* 1 ^ 2 < Jq" di = 2m 1 / 2 , 

rn — l 1 m— 1 1 

£(TP-)<2£— + 

i=0 S i P i=0 1 

El 71 ^-v 1 

a/2 + ~ Z^ 7 



i=l ^ i=l 

< — - • jn 1 / 2 + — • H{m) 

( 2 n 2 log n 

= CM n H 

V M 

For HAM, LAS, and EXC only fitness values in {0, ... , n} are possible [H]. The 
probability for the (1+1) EA making an improvement is bounded from below by 
Sl > l/(en(n - 1)) > l/(en 2 ) for HAM and EXC and by s t > l/(2en(n - 1)) > 
l/(2en 2 ). We thus get Si > a/n 2 in all cases when choosing a £ {l/e,l/(2e)} 
appropriately. For ring graphs Theorem Q] results in the bound 



< 



n— 1 ^ n — 1 

J/ 2 

r i— 

2n i - 2 



n— 1 1 1 n— 1 



El n % - 1 

z— 1 z— 1 

4n 1/9 ™ 3/2 \ 

< _ • n 1 / 2 + H(n) 

a, L l* a/j, 



= O 



,2 , n2 lQ g n 

m-l -, /-1/3 ^ r m 1 /.-1/3 j .* 1 e ...1/3 



For torus or grid graphs we get, using J2"Li 1/i 1 ^ 3 < /J™ l/* 1 ^ 3 di = 1.5 ■ r 



rt— 1 -, -, n— 1 



~| 11 

^ par )^ 3 E — + -E- 

i=i v ^ i=i Si 

o n 1 2 ra 1 

on 1 n 1 

- a l/3 jl/3 a „ Zv i 
i=l ^ i=l 

4.5n i /, n, 2 ,_. , 
< -ttt • n 1/3 + — • H(n) 
a L ' 6 afj, 

= o fn 4 / 3 + 1^512 
M 



Finally, for /f^ the result is 

„, mrar , 1 x— * 1 ™ 2 v~ > 1 ^ ( n 2 \ogn 
E(T P ) < n+ - — < n+ — ZJ ~ " ° n H 



□ 
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In contrast to HAM, LAS, and EXC, the upper bound for INV used a large 
number of 0{n 2 ) fitness levels. This had led to rather loose upper bounds for 
parallel optimization times. The informed reader might think that grouping fit- 
ness values to create fewer, larger fitness levels could yield better upper bounds 
for INV. A requirement for this to work is that mutations must still improve 
the fitness by so much that the current fitness level is left. However, this is not 
always possible for INV. Consider the permutation 



The difference between its fitness and the fitness of the optimum is 0(n 2 ). This 
large value suggests that large improvements are possible. But for the above 
permutation, every elementary operation increases the fitness by only 0(1). This 
indicates that parallelization does not always lead to drastic speedups for INV. 

A. 3 Proofs from Section 5 (Shortest Paths) 

Proof of Theorem We say that a vertex is optimized in case a shortest path 
to this vertex has been found. Due to the fitness function, such a shortest path 
can never be lost. As we are dealing with a multiobjective formulation of the 
problem, we cannot directly apply the fitness level method. Instead, we use this 
method for estimating the time until layers of vertices have been optimized. 

As in |14] we define £i as the maximum number of edges on any shortest path 
from the source s to node i. We consider layers of vertices with the same £-value. 
Then rij = =ff{i | — j} describes the number of vertices on the j-th layer, i. e., 
vertices where all shortest paths have at most j edges. 

Once all layers 1, . . . , j — 1 have been optimized, each vertex v in Layer j 
becomes optimized if a predecessor w on a shortest path is found for v. This is 
because a shortest path from s to w plus the edge (w, v) gives a shortest path 
to v. The probability of setting a correct predecessor for v and not changing any 
other predecessor is at least l/(en 2 ). If i vertices in Layer j are not optimized 
yet, the probability of increasing the number of optimized vertices is at least 
Si := i/(en 2 ). Applying the fitness-level method (Theorem[l| for each layer and 
noting there are at most I := max-jj | rij > 0} layers yields the following. For 
the ring graph or any other strongly connected topology 



( 




n 



) 
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As X)j=i n j = n an< i both functions yfx and In (a;) are concave, the worst case 
for both ^-terms is attained for m = ■ ■ ■ = ne = n/i. This yields 



,2c 

For the torus we get, using Y^T=i l/* 1 ^ 3 < Jo™ V* 1 ^ 3 di = 1.5 • m 1 / 3 , 

E(TPar) < E 3 E ^_ + E I E I 

3=1 i=l s i 3=1 p i=l 4 

.3eV3, 2 /3 EE _L + £!L EE i 

3=1 j=l p i=l 3=1 

£ 2 £ 

^2/3^2/3,™ 



< 4.5e 1 / 2 n 2 / 3 £ nf 3 + — + 1) 

3=1 ^ 3=1 

< 4 Se 1 / 2 n 4 / 3 • l 1 ^ + 6n2£ln(en/ ^ 

M 

For the complete graph we get 



E(n~)<£»j+i;-i;-< n+ e?i2 " n(en/£) . 

3 = 1 3 = 1 r i=l ^ 



□ 



A. 4 Proofs from Section 6 (Eulerian Cycles) 

Wc first prove Theorem[5]and then reuse some of the proof arguments for proving 
Theorem |U 

Proof of Theorem^ For every island, unless the current walk has formed a cy- 
cle, there is always at least one jump operation that increases the length of 
the walk. The probability of such a jump is at least 1/m 2 . By Chernoff bounds 
with probability 1 — e ~^( m ) after r > m 3 generations an island either once has 
reached a walk that forms a cycle of length m/2 or its current walk is strictly 
longer than m/2. 

Assume that the above condition holds for all islands. We estimate the prob- 
ability that all islands have reached a cycle of m/2 edges. Due to independence, 
we can focus on RLS running on a single island. 

One important observation for RLS is that once RLS has discovered a walk 
(e\, . . . , ei) of length I > 2 the current walk will always contain the edges 
ei,...,et- So, after the first walk of length at least 2 is discovered, it will typi- 
cally be extended by prepending a matching edge eo or appending a matching 
edge e.( + i. (In the latter case the walk can grow by more than one edge in case 
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the sequence of edges happens to continue with a proper edge and so on.) 
Once the current walk contains edges from both C and C, the global optimum 
can be found easily as there is always at least one jump operation extending the 
current walk. The expected remaining time until a Eulcrian cycle is constructed 
is bounded by m 3 . 

Assume pessimistically that the first walk of length at least 2 lies completely 
in C (say). Consider the first point of time where v* becomes part of the walk. 
Suppose that the walk starts with v* and that the walk is not equal to C. Then 
there are three edges that can jump to the first position of the current walk: the 
edge in C incident to the first edge of the walk and two edges in C that contain 
v* . As all jumps are equally likely, the probability that a jump adds one of the 
edges in C 1 before the edge from C is added is 2/3. Symmetric arguments apply 
if the walk ends with v*. There is one caveat, though. The jump that has added 
the edge of the current walk leading to v* can have added further edges. This 
would mean that the current walk has already extended past w*, depending on 
the edges following in the edge sequence. However, all three mentioned edges so 
far have been symmetric to the algorithm in a sense that their order has not 
had an impact on the fitness so far. Therefore, each of these is equally likely to 
be in the position of the next edge in the edge sequence. So we again have a 
probability of 2/3 that the walk has been extended towards C . 

The probability that at least one island makes the right decision is 1 — 3~ M . 
If this happens, the expected remaining optimization time is 0(m 3 ) as shown 
above. If this does not happen, we resort to the general upper bound 0(m 4 ) 
by Neumann |llj for the time until an improvement is found. This proves the 
claimed bound 0(m 3 + 3~ M • m 4 ). □ 

In order to prove Theorem we first need the following lemma about the 
concentration of hitting times for fair random walks on integers. It follows using 
standard Chcrnoff bounds. 

Lemma 1. For the fair random walk on Z ; starting in state 0, define T(k), 
k 6 IN, as the first hitting time of a state in {—k, +k}. We have Pr(T(k) = t) < 
2e- fe2 /* ift > 2k and Pr(T(k) = t) < 2(e/4) fc if t < 2k. 

Consequently, Pr(T(k) < t) < 2t(e/A) k if t < 2k and 
Pr(T(k) < t) < 4fc(e/4) fc + 2i e - fc2 /* ift > 2k. 

Proof. As the claimed bounds for Pr (T(k) = t) are non-decreasing with t, the 
second statement follows from the first one and the union bound. 

The proof of the first statement is a simple application of Chernoff bounds. 
Let X be the random number of steps among the first t iterations of the random 
walk where the current state is increased. Clearly, E (X) = t/2 and one of the 
two target states is reached in T steps if and only if AT = t/2 + k or X = 
t/2 — k. The probabilities for the last two events are equal, hence Pr (T(fc) = t) = 
2Pr (X = t/2 + k) and we only need to estimate the last probability. 
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We have t/2 + k = (1 + S) ■ E (X) for S = 2k/t. lit > 2k we use a well-known 
Chernoff bound for < S < 1 and have 

Pr (X = t/2 + k) < Pr (X > t/2 + k) 

< e -t/2-(2k/t) 2 /2 _ e ~k 2 /t^ 

For t < 2k we have 

Pr (X = t/2 + k) < Pr (X > t/2 + k) 

P s \ E W 

< 



(1 + <5) 1 +^ 

2fc/t \*/2 



{l + 2k/t) 1 + 2k / t 



(1 + 2fe/t)«/( 2 *)H 



" < (e/4) fe 



as 2/c/t > 1 and the function (1 + x) 1 ^ x+1 is monotonically increasing for x > 
1. □ 

Proof of Theorem^ The proof consists of two parts. We first prove that with 
constant probability the island model will reach a state where all islands need 
to rotate a cycle of length m/2 by a distance of 0(m). As all islands have the 
same fitness, we can safely ignore migration until the first island has found an 
improvement after rotating the cycle. The time it takes to get there will establish 
the lower bound. 

Consider the first point of time t* where an island extends its walk past v* . 
We know by the proof of Theorem [S] that there is a chance of 1/3 that the walk 
will continue in the same cycle. The probability that during the next t • diam(T) 
generations following t* no island makes a further improvement — and no other 
island makes a simultaneous improvement at time t* — is at least 

q \ (r diam(T) + l)/^ 



1 -, > J7(l) 

m(m -1)7 - w 

since there are always at most 6 improving jump operations and r diam(T)/^ = 
0(m 2 ). After this time all islands will have been taken over by the same solution. 
Hence, we have a probability of ^2(1) that one island extends its walk past v*, 
stays in the same cycle, and communicates this superior solution to all other 
islands. This means that the present edges of the walk will be maintained on 
each island until the first island has closed a cycle of m/2 edges. 

By the same argument, with again probability f2(l) and independently from 
the previous events, we have that the island that first closes this cycle is the 
only island where an improvement happens. Again this solution takes over all 
islands. Now we use the following argument on symmetry. So far all islands have 
behaved as if the instance consisted only of C. Due to the perfect symmetry of 
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the subgraph induced by C, each vertex is equally likely to be the start and end 
vertex of a walk covering C. With probability at least 1/2, again independently 
from previous events, we have that this vertex has distance at least m/8 from 
v* on the island that takes over the system. Then a rotation of the walk by a 
distance of &(m) is necessary for a further improvement. 

Following Neumann [TT], the only accepted operations for all islands are 
rotations of the cycle, unless it starts and ends in v* . As no direct fitness im- 
provements are possible and only strictly better immigrants are accepted, all 
islands evolve independently until an island finds an improvement. 

A step rotating the current cycle is called a relevant step. It has probability 
2/(m(m — 1)) and the probability of having more than t := 6m 2 /(ln^) relevant 
steps, for some constant b > specified later, in bm 3 (m— l)/(31n/i) generations 
is e ~ n ( m ) by Chcrnoff bounds. Assume in the following that each island makes 
at most t relevant steps, which happens with probability at least 1 — [i ■ e~ n \ m ). 

A clockwise rotation has the same probability as a counterclockwise rotation. 
If we map the possible positions of the start/end of the cycle to Z such that after 
takeover each island starts at 0, each island performs a fair random walk. This 
random walk has to cover a distance of at least m/8 from in order to reach 
v* . We apply Lemma [1] with k := m/8. The probability of reaching this goal in 
t := bm 2 / (in fj.) steps, b > an appropriate constant, is at most l/(2/i). By the 
union bound, the probability that any island has reached this goal is at most 
1/2. So, with probability at least 1/2 — /i ■ e~ f2 ( m ) the island model has not 
found an improvement after &m 3 (m — l)/(31n/^) = J?(m 4 /(log/i)) generations. 
As 1/2 — fi- e.- f2 ( m ) = i7(l), this establishes the claimed lower bound. □ 

Using symmetrically restricted jumps decreases the number of possible jump 
operations from m(m— 1) to only 2m. Recall that for revolving a cycle, only two 
jump operations are possible. These jumps are still feasible with the restricted 
operator and they now have a higher probability of 1/ (2m) each. This raises the 
probability of making a relevant step to 1/m. By exactly the same reasoning 
as above, only changing the period of bm 3 (m — l)/(31n/x) generations towards 
6m 3 /(31n^) generations, we arrive at the results shown in the second line of 
Table El 



